Trends in Hearing — Latest Matching Preprints

1

Auditory Working Memory Mediates the Relationship between Musical Sophistication and Speech-in-noise Perception

Colak, H.; Benzaquen, E.; Guo, X.; Lad, M.; Sedley, W.; Griffiths, T. D.

2026-05-13 neuroscience 10.64898/2026.05.13.724783 medRxiv

Top 0.1%

8.2%

Show abstract

Understanding speech in noisy environments (SPIN) is an important everyday ability, and engaging in musical activities has been proposed as a factor that may support this ability. However, the cognitive mechanisms underlying a potential musical advantage in SPIN perception remain unclear. Here we investigated whether musical sophistication is associated with better SPIN perception in a large population-based sample, and whether this relationship is mediated by auditory working memory (AWM), verbal working memory (VWM), or non-verbal intelligence. We recruited 203 participants and measured SPIN perception at both word and sentence levels. Musical sophistication was assessed using the Goldsmiths Musical Sophistication Index (Gold-MSI). AWM was measured using delayed matching of tone frequency or the modulation rate of amplitude modulated white noise, VWM was based on backward digit span task, and non-verbal intelligence used matrix reasoning. Mediation analyses revealed that AWM fully mediated the relationship between musical sophistication and SPIN perception, whereas VWM showed no mediation effect. Non-verbal intelligence showed a partial mediating effect. Additional control analyses using structural equation modelling revealed that the indirect effect through AWM remained significant after accounting for age, hearing thresholds, and non-verbal intelligence. Together, these findings suggest that individuals with greater musical sophistication demonstrate better daily life listening abilities, and that superior auditory working memory may be the key cognitive mechanism underlying this advantage.

2

Differentiating the Physiological Signatures of Cochlear Synaptopathy and Inner Hair Cell Damage in a Chinchilla Model

Sivaprakasam, A.; Schweinzger, I.; Heinz, M.

2026-05-08 neuroscience 10.64898/2026.05.05.723072 medRxiv

Top 0.1%

3.7%

Show abstract

Aging and noise over-exposure lead to complex mixtures of cochlear degradation that impair the structure and function of outer hair cells, inner hair cells (IHCs), and the cochlear nerve. However, IHC damage and cochlear synaptopathy (CS) remain pathologies "hidden" from the audiogram. This study aimed to identify and differentiate the physiological signatures of these two distinct pathologies using promising non-invasive assays: Envelope Following Responses (EFRs), Auditory Brainstem Response (ABRs), Wideband middle-ear reflexes (WB-MEMRs), and Distortion Product Otoacoustic Emissions (DPOAEs). We utilized chinchilla models of carboplatin-induced (CA) IHC damage (N = 4) and temporary threshold shift (TTS) noise-induced CS (N = 4) to compare the physiological signatures of each pathology. While both groups showed unchanged ABR thresholds two weeks after exposure, EFRs, ABR Wave V/I ratios, and MEMRs showed distinct effects of exposure. Despite non-elevated ABR-derived audiometric thresholds after exposure, both CA and TTS exposure resulted in severe in EFR "peakiness", particularly for sharp, short-duty-cycle stimuli and significant elevations in ABR Wave V/I ratios. However, these findings were less-pronounced in the TTS-exposed animals. WB-MEMR amplitudes were decreased with elevated thresholds in both groups; this effect was more pronounced in the TTS group. Opposite trends in DPOAE amplitudes indicated that while both IHC damage and CS result in similar suprathreshold temporal coding deficits, effects on outer-hair-cell integrity and auditory efferent physiology may differ between the two pathologies. Future work and novel diagnostics should aim to distinguish these specific cochlear pathologies in clinical populations, or at the very least consider their overlap. HighlightsO_LIA multi-metric diagnostic approach was used with chinchilla models of inner-hair-cell (IHC) damage and cochlear synaptopathy (CS). C_LIO_LIIHC damage and synaptopathy both cause suprathreshold deficits "hidden" from the audiogram. C_LIO_LIIHC damage results in more severe temporal envelope coding degradation than does synaptopathy. C_LIO_LIA combination of EFR "peakiness", ABR Wave V/I ratio, and Wideband Middle Ear Muscle Reflex (WB-MEMR) appear to be useful measures for profiling IHC damage and CS. C_LI

3

Beyond Onset Timing: Longer Sound Envelope Duration Enhances Neural Representation of the Musical Beat

Rosenzweig, F.; Lenoir, C.; Lenc, T.; Polak, R.; Huart, C.; Nozaradan, S.

2026-05-13 neuroscience 10.64898/2026.05.12.721298 medRxiv

Top 0.1%

1.9%

Show abstract

Musical rhythm is often experienced with a periodic beat, serving as a temporal reference for coordination with the rhythm. Thus far, models of beat processing have mainly relied on representing sensory inputs as patterns of onset timing, with limited consideration of other sensory features. Here, we challenge this view by showing that the internal representation of beat is affected by other temporal features of the stimulus beyond onset timing alone. We recorded electroencephalography (EEG) while participants listened to rhythmic sequences designed to elicit a beat. Across conditions, we manipulated the duration of the tones conveying the rhythms, while keeping all other parameters identical, including overall intensity, speed, and rhythmic pattern structure. Crucially, the beat periodicity was enhanced in neural activity with increased sound duration, even though the beat periodicity was not prominent in the acoustic features, thus ruling out basic sensory confounds. These results demonstrate the preferential role of longer sound durations in fostering temporal scaffolding processes that integrate fast rhythmic inputs into behavior-relevant internal structures such as the beat. More generally, our findings are compatible with a holistic processing account whereby a range of features beyond onset timing may be integrated into a neural representation of rhythm. Graphical Abstract: Fig. 2EEG was recorded while listeners heard rhythmic sequences eliciting a beat. Sound duration (sonic duty cycle) was varied across four conditions while speed, pattern, and intensity stayed constant. Beat-related EEG responses increased with longer sounds, and were enhanced in all conditions compared to auditory nerve model envelopes, which did not show prominent energy at the beat periodicity, ruling out sensory confounds. Results support holistic rhythm processing beyond onset timing alone. O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=101 SRC="FIGDIR/small/721298v1_fig2.gif" ALT="Figure 2"> View larger version (27K): org.highwire.dtl.DTLVardef@10a0599org.highwire.dtl.DTLVardef@f5a95forg.highwire.dtl.DTLVardef@42d1ceorg.highwire.dtl.DTLVardef@dc58a7_HPS_FORMAT_FIGEXP M_FIG O_FLOATNOFigure 2.C_FLOATNO EEG and auditory nerve model output analysis based on magnitude spectrum and autocorrelation. Each row represents a duty cycle condition. The two columns on the left represent the magnitude spectrum-based analysis. The first column represents the group-level averaged magnitude spectra at a pool of fronto-central electrodes, across conditions. Beat-related frequencies are shown in red, and beat-unrelated frequencies are shown in blue. Scalp topographies of the neural activity measured at the average magnitudes of beat-related (in red circle) and unrelated (in blue circle) frequencies are represented as insets. The second column represents the normalized magnitude spectra obtained from the auditory nerve model output for each duty cycle sequence. The two columns on the right represent the autocorrelation-based analysis (for visualization purposes, only a subset of lags from 0 to 2.4 s corresponding to the pattern duration is shown). The first column represents the group-level averaged autocorrelation function measured from the same pool of fronto-central electrodes, across conditions. Beat-related lags are shown in red, and beat-unrelated lags are shown in blue. The second column represents the autocorrelation function of the auditory nerve model output for each duty cycle sequence. C_FIG

4

Iconic Sound-Shape Correspondences in Aphasia

Dorsi, J.; Sandberg, C.; Lacey, S.; Nygaard, L.; Sathian, K.

2026-05-19 neuroscience 10.64898/2026.05.18.725976 medRxiv

Top 0.1%

1.2%

Show abstract

PurposeTo examine speech iconicity for shape in aphasia, we compared iconicity ratings from people with aphasia to those from neurologically intact individuals and evaluated how iconicity relates to phonological and semantic processing profiles in aphasia. MethodEleven people with aphasia and 11 age- and gender-matched neurologically intact participants rated how rounded or pointed 50 auditory pseudowords sounded using a 5-point scale. Ratings from participants with aphasia were compared to predicted iconicity ratings derived from reference ratings from prior work and to ratings from neurologically intact participants. For each participant with aphasia, correlations between individual ratings and predicted ratings were related to measures of phonological and semantic processing. ResultsRatings from people with aphasia were significantly correlated with both the predicted ratings and the ratings from neurologically intact participants. The strength of the correlation between individual ratings and predicted ratings did not differ significantly between groups, although there was a trend toward weaker correlations in the aphasia group. There were indications that greater language impairment was associated with greater disruption of iconicity ratings; in particular, deficits in phonological segmentation and semantic processing were associated with reduced sensitivity to shape iconicity. ConclusionThese findings suggest that sensitivity to shape iconicity is preserved in individuals with aphasia to varying degrees. The specific nature of language impairment appears to play an important role in determining iconicity processing in aphasia.

5

Bridging Acoustic and Semantic Spaces for Interpretable Voice Scoring via Zero-Shot Semantic Expansion

Hsiao, C.; Cheng, Y.-R.; Yang, C.-Y.; Hsu, F.-S.

2026-06-01 health informatics 10.64898/2026.05.29.26354442 medRxiv

Top 0.1%

0.7%

Show abstract

Subjective auditory-perceptual evaluation and uninterpretable deep learning models limit the clinical assessment of voice disorders. This study proposes a two-phase zero-shot framework to evaluate voice pathology. First, an Audio Spectrogram Transformer is fine-tuned on the Perceptual Voice Quality Database to generate an acoustic latent space. Second, Orthogonal Procrustes analysis maps these acoustic embeddings directly onto the semantic space of a pre-trained Sentence Transformer. The geometric alignment produced continuous semantic axes that outperformed a supervised machine learning baseline in regressing clinician-rated GRBAS (Grade, Roughness, Breathiness, Asthenia, and Strain) severity scales. Furthermore, these axes correlate with traditional acoustic measures, including Harmonics-to-Noise Ratio and local jitter, while remaining robust when applied to aperiodic signals by not requiring fundamental frequency extraction. Most importantly, the model achieved zero-shot semantic expansion, successfully evaluating voices using an untrained, natural clinical vocabulary beyond the GRBAS scale. External validation on the Voice ICarus Database confirmed cross-corpus stability and demonstrated the capacity for zero-shot differential phenotyping of specific etiologies, such as hypokinetic dysphonia and reflux laryngitis. By bridging acoustic and semantic latent spaces, this framework offers an objective, continuous, and transparent metric for evaluating voice quality using voice descriptive vocabulary.

6

Auditory perceptual expertise: Amplitude modulation rate discrimination near the threshold for detection

Garcia Ruiz, T.; Sanes, D. H.

2026-05-11 animal behavior and cognition 10.64898/2026.05.06.723339 medRxiv

Top 0.2%

0.5%

Show abstract

Many perceptual skills improve with a few days of training. However, weeks or months of practice may be required to reach a level of expertise on complex tasks (Watson, 1980). Here, we explored how gerbils attain expertise on a difficult task: amplitude modulation (AM) rate discrimination at very shallow AM depths, similar to the depths used during vocal communication. Using an appetitive Go-Nogo procedure, we first trained 6 gerbils to perform an AM discrimination task (Nogo: 4 Hz; Go: 4.25-10 Hz) at a depth of 0 dB (re: 100% depth). Animals were then trained to perform AM discrimination at successively shallower depths, from -3 to -18 dB, requiring an average of 5-10 days of practice to reach a performance metric of d[≥]1 for each depth. Finally, we determined that AM discrimination thresholds were nearly identical between 0 to -12 dB, and only slightly elevated at -15 dB. Improvements in performance were accompanied by a large reduction in response time during procedural learning, and a gradual reduction of response time during perceptual learning, even as AM depth became shallower (i.e., more difficult). The shallowest depth at which gerbils displayed peak performance on the AM discrimination task is similar to their lowest AM depth detection thresholds. These results suggest performance on challenging auditory perceptual tasks require prolonged practice, and is accompanied by increased automaticity (i.e., lower response time) that stabilizes once expertise is achieved.

7

Optogenetic cochlear stimulation evokes midbrain activity with near-physiological temporal fidelity

Koert, E.; Götz, J.; Albrecht, N.; Vavakou, A.; Wolf, B. J.; Moser, T.

2026-05-19 neuroscience 10.64898/2026.05.16.724905 medRxiv

Top 0.2%

0.4%

Show abstract

When hearing fails, stimulation of the auditory nerve by electrical cochlear implants (eCIs) partially restores hearing, with most eCI users achieving open speech understanding. However, the broad current spread from each electrode limits frequency coding and speech understanding in daily situations with background noise. Spatially confined optogenetic stimulation by future optical cochlear implants (oCIs) improves frequency coding but millisecond closing kinetics of channelrhodopsins (ChRs) might limit temporal coding. Here, we evaluated the utility of fast-closing ChR f-Chrimson for processing temporal information in the auditory system of Mongolian gerbils. We recorded neural activity in the inferior colliculus evoked by f-Chrimson-mediated optogenetic stimulation of the cochlea. F-Chrimson enabled energy-efficient stimulation of the auditory pathway at rates [≥]150 Hz, outperforming the slower ChR variants CatCh (blue) and ChReef (green). Energy thresholds for activation of the auditory pathway were in the low {micro}J range, between ChReef (sub-{micro}J) and CatCh. Dynamic range and frequency selectivity were comparable to previous observations with CatCh and outperformed electrical stimulation. In conclusion, employing fast-gating ChRs harnesses improved spectral coding without degrading temporal coding. The Paper ExplainedO_ST_ABSProblemC_ST_ABSElectrical cochlear implants (eCIs) partially restore speech comprehension in most of 1 million otherwise severely deaf people. However, most CI-users face challenges hearing in daily situations. Spectrally more selective stimulation of the auditory nerve by optical cochlear implants (oCIs) promises to overcome this limitation. However, the closing kinetics of channelrhodopsins (ChR) limit the temporal bandwidth of bionic sound coding. Improving the ChR properties and evaluating temporal coding remain major objectives for developing hearing restoration by oCI. ResultsHere, we evaluate the utility of waveguide-based oCI using the fast-closing ChR Chrimson (f-Chrimson) for encoding of temporal, spectral and intensity information by multi-electrode-array (MEA) recordings from the midbrain. We compare f-Chrimson-mediated bionic coding to acoustic coding as well as to previous data acquired with optogenetic stimulation using other ChRs and with electrical stimulation. F-Chrimson enabled energy-efficient stimulation of the auditory pathway at rates [≥]150 Hz, outperforming the slower ChR variants CatCh (blue) and ChReef (green). Intensity and frequency coding were comparable to previous observations with CatCh and outperformed electrical stimulation. ImpactThis study demonstrates near physiological temporal coding with the fast-closing ChR f-Chrimson, indicating that improved spectral coding by oCI is not traded off by poor temporal fidelity.

8

Sound Localization Is Biased By Simultaneous And Delayed By Preceding Visual Distractors

Rocchi, F.; Haukes, N. C.; van Opstal, A. J.; van Wanrooij, M. M.

2026-05-15 neuroscience 10.64898/2026.05.12.724474 medRxiv

Top 0.2%

0.4%

Show abstract

AO_SCPLOWBSTRACTC_SCPLOWVision can shape auditory perception, especially when visual cues occur at different times and locations than sounds. Simultaneous but spatially misaligned lights bias the perceived location of a sound--a phenomenon known as the ventriloquism effect. Temporally misaligned lights can also affect the latency of auditory responses. However, it remains unclear how multiple visual stimuli that differ from sounds in both space and time jointly influence localization behaviour. We investigated how visual distractors, spatially misaligned by 10{degrees}, presented before and/or during a target sound influence localization accuracy and response latency in a rapid head-pointing task. Human listeners localized brief (150 ms) broadband noise bursts with an average root-mean-square error of 5{degrees} and a baseline latency of 252 ms. Simultaneous visual cues induced the ventriloquism effect, in which the perceived sound location was biased by 1.8{degrees}. Response latency also increased by 21 ms (273 ms). Preceding visual stimuli (2 s duration) did not induce a bias, but increased latency by 55 ms (307 ms). Introducing a 200 ms gap between the preceding light and the sound reduced this latency increase to 24 ms (276 ms), still not inducing a significant bias. When we presented both a preceding and a simultaneous light on opposite sides of the sound, localization reflected the bias induced by the simultaneous light (1.8{degrees}) and the latency increase induced by the preceding light (by 48 ms). These findings reveal a dissociation in audiovisual integration: preceding visual stimuli primarily influence when a sound is responded to (latency), while simultaneous stimuli influence where it is perceived (accuracy). This supports causal inference models of multisensory integration and suggests distinct underlying mechanisms for spatial and temporal processing of sounds in sensorimotor circuits.

9

How do we align in good conversation? Investigating the link between interaction quality and multimodal interpersonal coordination

Dominguez-Arriola, M. E.; Lam, P. C. H.; Perez, A.; Pell, M. D.

2026-05-11 neuroscience 10.64898/2026.05.09.723997 medRxiv

Top 0.2%

0.2%

Show abstract

Conversations can feel effortlessly engaging or, conversely, difficult and unrewarding. Multiple factors contribute to the experienced quality and outcomes of a conversation, among them how interlocutors align with each other. The present study investigated speech-to-speech, brain-to-speech, and brain-to-brain coordination as markers of interpersonal alignment, examining their relationship with jointly perceived interaction quality and mutual affinity between conversational partners. Pairs of previously unacquainted participants (dyads) engaged in multiple short, free-form conversations on topics of varying interest while their vocal and neural activity were simultaneously recorded in a dual-EEG ("hyperscanning") setup. We analyzed interlocutors prosodic adaptation, neural speech tracking, and neural coordination during each conversation. At the speech-to-speech level, our findings reveal that partners with more positive mutual impressions became more similar in their volume and voice quality over the course of the experiment session, reflecting greater prosodic convergence. At the brain-to-speech level, we found no reliable effect of interaction quality on neural tracking of unfolding speech within any individual region, although topographical differences suggested relative modulation across scalp sites. Finally, at the brain-to-brain level, our findings show that higher perceived interaction quality enhanced inter-brain relationships across frequency bands (alpha and theta) and temporal dependencies (concurrent/near-instantaneous and recurrent/listener-lagging), with the strongest effects observed for concurrent alpha-band coupling. These findings suggest that distinct coordination processes are involved in how interlocutors experience an interaction and how they establish relational affinity, casting new light into the mechanisms that make a conversation worthwhile.

10

Simulating the spectrum, not the syndrome: Large scale individualized modeling of oral reading in stroke aphasia

Staples, R.; DeMarco, A. T.; Laks, A. B.; Turkeltaub, P. E.

2026-05-13 neuroscience 10.64898/2026.05.11.724319 medRxiv

Top 0.3%

0.2%

Show abstract

Computational models are a linchpin in our understanding of the neurocognitive basis of reading. These models can simulate idealized profiles of alexia syndromes, but in reality, individuals with alexia present with a wide range of mixed deficits rather than idealized syndromes. To provide a complete cognitive theory of reading, computational models must be able to account for this individual variation. However, this has never been demonstrated. We test oral reading and non-reading phonological and semantic processing in 83 left-hemisphere stroke survivors. We show that individual alexia profiles can be simulated by applying graded phonology and semantic lesions to an artificial neural network model of reading, creating "matched models" that represent individual stroke survivors. The severity of damage to the semantic and phonological layers of the matched models was highly correlated with directly-measured semantic and phonological processing deficits. However, we also identify systematic ways in which the models fail to simulate the reading performance of their matched stroke survivors. Our results support theories of alexia that rely on process-based deficits, demonstrate the feasibility of large-scale individualized modelling of alexia, and suggest ways to further improve the correspondence of models and human reading behavior.

11

Characterizing reward sensitivity to natural singing: an individual differences approach

Segura, E.; Lorenzo-Seva, U.; Zatorre, R.; Kleber, B. A.; Rodriguez-Fornells, A.

2026-05-07 neuroscience 10.64898/2026.05.04.722621 medRxiv

Top 0.3%

0.1%

Show abstract

Singing is an innate human behaviour present across cultures and the lifespan. Despite lacking direct biological advantages, its ubiquity suggests that it is intrinsically rewarding. This research aimed to investigate the underlying factors that explain variability in sensitivity to deriving reward and enjoyment from natural singing in the general population. In Study 1 (n = 606), an initial pool of items describing daily, non-professional singing behaviours were administered to an international adult sample. Exploratory factor analysis revealed a unidimensional structure of 20 items with acceptable model fit, organized into five facets representing distinct domains of singing-related rewards: 1) pleasure and emotional evocation, 2) social singing reward, 3) singing frequency, 4) mood regulation through singing, and 5) inattentional singing during routine tasks. In Study 2 (n = 430), confirmatory factor analysis in a new sample supported this structure. When both samples were combined (n = 1036), the unidimensional model defined by these five facets showed acceptable to excellent goodness-of-fit indices, supporting the conceptualization of singing reward as a multidimensional construct with differentiated facets. This led to the Barcelona-Aarhus Natural Singing Engagement Questionnaire (BANSEQ), which demonstrated excellent reliability ( = .94) and population-level stability. Study 3 (n = 1036) tested the convergent validity of BANSEQ with measures of music reward and engagement and identified sociodemographic and psychological correlates across the five facets of singing reward. Overall, these findings characterize the sources of individual differences in the hedonic experience of natural singing and propose BANSEQ as a robust psychometric tool for its assessment in the general population.

12

Multichannel optical cochlear implants enable spectrally distinct auditory activity

Albrecht, N.; Koert, E.; Vavakou, A.; Roos, L.; Jablonski, L.; Marcoleta, J. P.; Cardona Audi, J.; Alfken, J.; Aakhte, M.; Klein, E.; Salditt, T.; Huisken, J.; Ruther, P.; Mager, T.; Kusch, K.; Moser, T.

2026-05-19 neuroscience 10.64898/2026.05.15.725096 medRxiv

Top 0.3%

0.1%

Show abstract

When hearing fails, cochlear implants (CIs) partially restore auditory perception. Yet, poor coding of spectral information remains a bottleneck as each electrode broadly activates the auditory nerve. As light can be more conveniently confined, optical (o)CIs present a promising alternative. Here, we combined expression of the potent channelrhodopsin ChReef in spiral ganglion neurons (SGNs) and oCIs based on 5-10 green LED in gerbils. We characterized the oCI encoding of intensity and spectral information by ChReef-SGNs using recordings from the central nucleus of the inferior colliculus (ICC). ChReef aligned light sensitivity of SGNs well with the radiant fluxes provided by individual LEDs: ICC-activity had thresholds <200 nJ and reached a maximum close to that achieved with 46 dB tones. Multichannel oCIs enabled tonotopically ordered and spectrally distinct stimulation indistinguishable from acoustic stimulation for up to moderate activity levels. Some LEDs elicited >1 spectral peaks for stronger intensities. Representational Similarity Analysis and Linear Discriminant Analysis of ICC activity indicated improved channel discriminability of optical over electrical stimulation. In summary, {micro}J oCI stimulation achieves near-physiological spectral resolution. The Paper ExplainedO_ST_ABSProblemC_ST_ABSElectrical cochlear implants (eCIs) partially restore speech comprehension in most of >1 million otherwise deaf users, who still face challenges hearing in daily situations. This is primarily due to poor spectral selectivity of electrical sound encoding. Spatially more confined optogenetic activation of the auditory nerve by optical cochlear implants (oCI) promises to overcome this limitation. However, a thorough characterization of bionic coding of sound information by multichannel oCI is needed to evaluate the potential for improved hearing restoration. ResultsHere, we combine the potent channelrhodopsin ChReef and 10-channel oCI based on green LEDs in gerbils and characterize their utility for encoding of spectral and intensity information by multielectrode array recordings from the midbrain. ChReef enabled activation of the auditory pathway with nano-joule thresholds and up to high levels of midbrain activity with low {micro}J radiant energy. The cochlear spread of excitation and channel discriminability for low to medium activity levels were close to what we observed with acoustic stimulation. ImpactOur work demonstrates great potential of multichannel optogenetic stimulation for encoding sound frequency information.

13

Verb-Specific Linking Properties Modulate the N400 Effect: Evidence from Thematic Reversal Anomalies in Malayalam

Shalu, S.; Muralikrishnan, R.; Schlesewsky, M.; Bornkessel-Schlesewsky, I.; Choudhary, K. K.

2026-05-19 neuroscience 10.64898/2026.05.15.725327 medRxiv

Top 0.3%

0.1%

Show abstract

The present study examined whether thematic reversal anomalies are processed similarly across subject and object experiencer constructions in Malayalam. Event-related brain potentials (ERPs) were recorded as 30 first-language speakers of Malayalam read transitive sentences with the two types of experiencer verbs, in which the thematic role assignment for the preceding arguments was either correct or reverse. The reversal anomaly became apparent only at the position of the experiencer verb. A linear mixed-models analysis confirmed a biphasic N400-P600 effect at the verb for both verb types when the argument roles were reverse. Thus, our results suggest a uniform processing strategy for TRAs irrespective of the type of experiencer verb involved. However, the N400 amplitude was larger for the object experiencer verb compared to subject experiencer verbs. We suggest that the quantitative difference observed for object experiencer verbs is due to the inverse linking of grammatical function and thematic roles associated with these verbs. In other words, verb-specific linking properties modulate the processing of TRAs involving object experiencer verbs. We argue that this modulation occurs because the parser recalibrates cue weighting when the expected form-to-meaning mappings are overridden by the inverse linking properties of object experiencer verbs.

14

Individual variation in sound localization accuracy is correlated with the properties of eye movement-related eardrum oscillations (EMREOs)

Herche, J. L.; King, C. D.; Groh, J. M.

2026-05-14 neuroscience 10.64898/2026.05.13.724941 medRxiv

Top 0.3%

0.1%

Show abstract

Calibration of sound localization behavior in species with mobile eyes requires not only accurate visual input but also accurate oculomotor signals across the lifespan. The recent discovery of eye movement-related eardrum oscillations suggest that oculomotor signals may be incorporated into auditory processing at the level of the ear. One inference of this discovery is that individual variation in such signals might be correlated with individual variation in sound localization accuracy. Here, we tested this hypothesis in humans with normal hearing. We discovered that there is considerable variation in the accuracy of sound localization (here, saccades to sounds) even in normal individuals: median horizontal errors ranged from 2-6{degrees}, and median vertical errors could be as large as 36{degrees}. We separated the subject pool into groups with "good" performance (median vectorial error < 8{degrees}) vs "poor" performance (median vectorial error > 10{degrees}) and evaluated their respective EMREOs. The EMREOs differed across the two groups in both horizontal and vertical dimensions, in how saccade amplitude vs. initial eye position was encoded, and across time with respect to the saccade. These results are consistent with the interpretation that EMREOs are associated with underlying processes that ensure the accuracy of sound localization. HIGHLIGHTSO_LIThe accuracy of eye movements to look at sounds varied across individuals, with median errors spanning a greater than 10-fold range. This range is surprising given that the participants passed screening for normal hearing. C_LIO_LI"Good" vs "poor" sound localizers exhibited differences in their eye movement-related eardrum oscillations (EMREOs) C_LIO_LIEMREOs differed in both horizontal and vertical sensitivity, for both saccade amplitude and initial eye position, and the differences varied in timing with respect to saccade onset. C_LIO_LIWe interpret the results under the theory that poor sound localization may be a consequence of poor eye movement encoding, without which linking visual and auditory space is likely inaccurate. C_LI

15

The time course of co-speech gesture production: An MEG study

Sekine, K.; Okuma, R.; Ban, H.

2026-05-07 neuroscience 10.64898/2026.05.04.722691 medRxiv

Top 0.4%

0.1%

Show abstract

People frequently gesture while speaking, even when listeners cannot see them--for instance, during phone calls or behind barriers. Congenitally blind individuals also gesture, indicating that gestures serve functions beyond visual communication. Previous models of gesture production (e.g., Kita & Ozyurek, 2003; Rauscher et al., 1996) suggest that gestures facilitate speech, but they rely heavily on behavioural data and provide limited insight into temporal dynamics. This study used magnetoencephalography (MEG), a neuroimaging technique with high temporal resolution, to investigate when gestures influence speech. Twenty-three native Japanese speakers took part in a storytelling task under two conditions: Gesture-Required (gesture use instructed) and Gesture-Prohibited (hands kept still). Participants described cartoon clips across multiple sessions (30 trials x 3 sessions per condition). Using speech onset as the reference point, we compared root mean square (RMS) values within a -0.25 to 0 second window. RMS values were higher in the Gesture-Prohibited condition, with increased activity in the bilateral anterior temporal lobes (Left ATL: p = .049; Right ATL: p = .027), but not in motor regions (p = .29). These findings suggest that gestures reduce neural load in language-related regions before articulation. Co-speech gestures may support speech planning by facilitating lexical retrieval or semantic structuring. The lack of motor region effects indicates that this influence is linguistic rather than motoric. This study provides direct direct neurophysiological evidence of the timing of gesture-speech interaction, supporting models that view gestures as an integral part of speech production.

16

Sympathetic activation of sensory input and learning

Flo, E. E.; Flo, G. M.

2026-05-05 neuroscience 10.64898/2026.05.01.722216 medRxiv

Top 0.4%

0.0%

Show abstract

Summary paragraphA hallmark of learning is the need for sensory stimuli (Ginns, 2015; McGraw et al., 2009; Reinwein, 2012; Spence, 1950) so that learning is fundamentally based on sensory input signals affecting behaviour, physiology, and neurology. If behavioural measures of learning can be causally linked to physiological and neurological variables, a broader understanding of the mechanisms related to learning in schools, learning disabilities, and learning and health issues may emerge (McGraw et al., 2009). Despite decades of research on the physiological/neurological variable of sympathetic activation, learning, and achievement (Horvers et al., 2021), any causal relation remains unclear (Cowley et al., 2014; Mason et al., 2020; Pijeira-Diaz et al., 2016; Sung et al., 2023; Yu et al., 2024) and issues with instrument validation remain (Costantini et al., 2023; Hu et al., 2024; Milstein & Gordon, 2020; Van Der Mee et al., 2021). Here we investigate the effect of sensory input on sympathetic activation by using validated instruments for skin conductance measurement (Batista et al., 2019) and whether sympathetic activation is connected to learning in a cognitive laboratory context and an ecologically valid classroom context. In both contexts, we found a physiological variable which correlated with learning and that sensory input affected this variable while student movement did not. These sensory inputs varied depending on the different instructional activities the students participated in. Together, these findings bring us one step closer to a model linking sensory input to behavioural, physiological, and neurological variables.

17

DECODING HOW THE SOUNDS OF WORDS AND PSEUDOWORDS SIGNIFY SHAPE: AN fMRI STUDY

Kumar, G. V.; Lacey, S.; Nygaard, L.; Sathian, K.

2026-05-16 neuroscience 10.64898/2026.05.15.725463 medRxiv

Top 0.4%

0.0%

Show abstract

Iconicity refers to systematic links between word form and meaning. Although evidence for iconicity in natural language continues to grow, its neural basis remains unclear. Using functional magnetic resonance imaging (fMRI) and multivariate pattern analysis (MVPA), we examined iconic shape associations of auditory real words and pseudowords. The pseudowords were matched to the real words in phonemic and phonotactic properties, while differing primarily in the absence of learned semantic representations. Participants listened to each item and judged whether it sounded rounded or pointed. Searchlight MVPA revealed significant decoding for both stimulus types. For real words, iconic shape associations were decoded above chance in regions associated with visual and haptic shape processing (left lateral occipital complex and left anterior intraparietal sulcus), visual imagery (bilateral precuneus), phonological processing (bilateral supramarginal gyri), and semantic processing (left middle frontal and right superior frontal gyri). For pseudowords, significant decoding was found in regions associated with multisensory feature organization (right posterior intraparietal sulcus) and language processing (left angular and inferior frontal gyri). Together, these findings provide evidence for neural mechanisms mediating iconic associations, with language-related areas involved for both real words and pseudowords, and visual processing for real words.

18

Increased chromatin accessibility following 1α,25-dihydroxyvitamin D3 treatment in human endometrial stromal cells

Yi, M.; Bostan, H.; DeMayo, F. J.

2026-05-09 molecular biology 10.64898/2026.05.06.723064 medRxiv

Top 0.5%

0.0%

Show abstract

Vitamin D signaling has recognized roles in female reproductive physiology, but its effects at the chromatin level in endometrial stromal cells are still unclear. Here, we investigated how the active form of vitamin D, 1,25-dihydroxyvitamin D3, or calcitriol, influences the accessible chromatin landscape of human endometrial stromal cells. Assay for transposase-accessible chromatin using sequencing (ATAC-seq) was performed on T-HESCs treated with either a vehicle or 1,25(OH)2D3. Ligand treatment increased overall chromatin accessibility, shown by higher ATAC-seq signal intensity, while causing only minor changes in the total number of called peaks. Peak annotation revealed that accessible regions were spread across both promoter-proximal and distal genomic areas. Integrating this data with CUT&RUN and RNA sequencing showed that most vitamin D-responsive cistromic modifications and transcripts were linked to nearby open chromatin, though fewer were associated with regions that were significantly differentially accessible. These results suggest that 1,25(OH)2D3-dependent transcription mainly occurs within a permissive, pre-accessible chromatin environment. This study offers new evidence that active vitamin D influences the epigenomic landscape of human endometrial stromal cells, establishing the chromatin-based molecular response to a chemically-defined VDR ligand, 1,25(OH)2D3, relevant to stromal differentiation and preparation for decidualization. HighlightsO_LIFirst evidence suggesting the direct impact of active vitamin D, 1,25-dihydroxyvitamin D3, 1,25(OH)2D3, enhanced the signal intensity of chromatin accessibility in human endometrial stromal cells C_LIO_LIMost accessible chromatin regions were shared between vehicle and ligand-treated human endometrial stromal cells C_LIO_LI1,25(OH)2D3-responsive transcription occurs largely within pre-accessible chromatin in human endometrial stromal cells C_LIO_LIAssay for transposase-accessible chromatin sequencing (ATAC-seq) defines a chromatin-level pharmacologic response to a chemically defined VDR ligand in human endometrial stromal cells C_LI

19

3'UTR Insertion of a Directed-Evolved RNA Element for Enhanced Translation

Liu, X.; Zhang, Q.; Wang, J.; Zhang, Z.; Zhang, L.

2026-05-09 molecular biology 10.64898/2026.05.07.723449 medRxiv

Top 0.5%

0.0%

Show abstract

Translation efficiency remains a major limitation for RNA therapeutics. Conventional optimization targets the 5 untranslated region (5 UTR), while the 3 UTR is viewed mainly as a stabilizing element. Here, we demonstrate that the 3 UTR can be rationally engineered to actively enhance translation. Using an intracellular directed-evolution platform based on the SINEB2 element, we identified RNA modules P51 and its compact variant P51t3,which markedly increased protein output without affecting mRNA levels. P51t3 consistently boosted expression two- to six-fold across plasmid, in vitro transcribed mRNA, and recombinant AAV systems. Mechanistic studies revealed that P51t3 binds ribosomal protein RPL39, recruiting 60S subunits to the initiation site through the natural closed-loop translation model. By integrating evolutionary selection with 3 UTR design, this work redefines the 3 UTR as an active translational enhancer and provides a broadly applicable regulatory element for next-generation mRNA and gene-delivery therapeutics.

20

Neural evidence for an abstract sense of number in humans at birth

Buiatti, M.; Eccher, E.; Petrizzo, I.; Pradal, U.; Taddei, F.; Vallortigara, G.; Izard, V.; Piazza, M.

2026-05-09 neuroscience 10.64898/2026.05.08.723896 medRxiv

Top 0.5%

0.0%

Show abstract

Whether humans possess an abstract sense of number at birth, and how it is implemented in the brain, remain open questions. Here, using high-density EEG and a frequency-tagging paradigm, we investigated these questions by measuring neural responses in 21 newborns (0-3 days old) to visual arrays that were either numerically congruent or incongruent with previously familiarized auditory sequences. Despite newborns limited attentional span, robust visual steady-state responses were observed across all conditions. Critically, response amplitude was significantly reduced for numerically congruent relative to incongruent stimuli. This early form of cross-modal numerosity repetition-suppression mechanism provides direct neural evidence that newborns encode numerosity across modalities in an abstract, supramodal format within the first days of life. These results support the view that number constitutes a foundational dimension of human perception and that sensitivity to this parameter is part of the innate toolkit present at birth.